Flex processing
Flex processing provides significantly lower costs for Chat Completions or Responses requests in exchange for slower response times and occasional resource unavailability.
It is ideal for non-production or lower-priority tasks such as model evaluations, data enrichment, or asynchronous workloads.
Set the service_tier parameter to flex in your API request (Chat or Responses) to take advantage of Flex processing.